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Siahl is an E3 ubiquitin ligase that contributes to proteasome-mediated 
degradation of multiple targets in key cellular processes and which shows 
promise as a therapeutic target in oncology. Structures of a truncated Siahl 
bound to peptide-based inhibitors have been reported. Here, new crystallization 
conditions have allowed the determination of a construct encompassing dual 
zinc-finger subdomains and substrate-binding domains at significantly higher 
resolution. Although the crystals appear isomorphous, two structures present 
distinct states in which the spatial orientation of one zinc-finger subdomain 
differs with respect to the rest of the dimeric protein. Such a difference, which 
is indicative of conformational freedom, infers potential biological relevance 
related to recognition of binding partners. The crystallization conditions and 
improved models of Siahl may aid future studies investigating Siahl-ligand 
complexes. 



PDB references: Siahl, 4c9z; 4ca1 
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Post-translational modification by ubiquitination controls many 
cellular processes, including the regulation of proteasome-mediated 
protein degradation (Grabbe et al, 2011; Kramer et al, 2013). 
Ubiquitination requires a cascade of enzymes, with the target or 
substrate specificity finally defined by the E3 ubiquitin ligase 
component, a noncatalytic component of the active complex that is 
formed with an E2 ligase. Siahl (seven-in-absentia homologue 1) is 
an E3 ligase belonging to the RING (really interesting new gene) 
domain family (Budhidarmo et al, 2012). It acts as a scaffolding 
protein and interacts with a variety of different substrates for 
ubiquitination and subsequent degradation (Qi et al, 2013). Other 
proteins regulate the activity of Siahl and some assemble together 
with Siahl into multi-component E3 ligase complexes. Siahl func- 
tions in Ras, DNA-damage and hypoxia signalling pathways, and is 
implicated in various cancers (House et al., 2009). Inhibition of Siahl 
activity impairs tumour growth and metastasis, suggesting that the 
protein is a potential therapeutic target (Wong & Moller, 2013). 

Siahl consists of an N-terminal RING domain, two zinc-finger 
subdomains and a C- terminal substrate-binding domain (SBD; 
Polekhina et al., 2002). The SBD is primarily responsible for dimer 
formation. A number of Siahl structures without the RING domain 
have been determined, with a highest resolution of 2.2 A (PDB entry 
2a25; Santelli et al., 2005). Other structures have lower resolutions of 
between 2.4 and 3.0 A (PDB entries lk2f, 2an6, 4i7b, 4i7c and 4i7d; 
Polekhina et al, 2002; House et al, 2006; Stebbins et al, 2013). 

In support of compound-screening studies and a structure-based 
approach to ligand development (Hunter, 2009), we sought crystals of 
Siahl that would allow us to validate potential hits. Here, we present 
two new crystallization conditions for Siahl which led to high- 
resolution structures. The resulting models contained a number of 
improvements, such as modelled flexible loop regions that were 
absent in earlier structures, and moreover despite being 'isomor- 
phous' revealed structural heterogeneity in the orientation of a zinc- 
finger subdomain. 
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2. Methods 

2.1. Crystallization and data collection 

A gene fragment encoding human Siahl without the RING 
domain (residues 91-282; Fig. 1; UniProt entry Q8IUQ4) was cloned 
and expressed and the product was purified as described previously 
(Rimsa et al, 2013). The protein was screened for crystallization 
conditions using the high-throughput Phoenix liquid-handling system 
(Art Robbins Instruments/Rigaku) with commercially available 
formulations. Crystallization occurred at room temperature within 
3 d in several conditions. Two of these conditions, No. 62 from 
The Classics Suite (Qiagen) and No. 86 from the Helsinki Random 
Screen 1 (Biocenter Finland), were refined using a hanging-drop 
vapour-diffusion method to yield diffraction-quality crystals. Opti- 
mized crystals grew from equal volumes of protein solution 
(15 mg ml" 1 in 15 mM Tris-HCl pH 7.5, 30 ml NaCl, 10 mM DTT) 
and two reservoirs. The first condition optimized to a reservoir 
(reservoir I) consisting of 100 mM MES pH 6.5, 1.5 M MgS0 4 , while 
crystals were subsequently obtained using reservoir II (100 mM 
HEPES pH 7.0, 1.45 M Li 2 S0 4 ). The crystals exhibited similar shapes 
(multi-faced prisms) and sizes (300 x 150 x 150 um) irrespective of 
the reservoir used. The crystals were cryoprotected by the addition of 
20%(v/v) glycerol to the appropriate reservoir prior to flash-cooling 
in liquid nitrogen and data collection at approximately 100 K. The 
diffraction properties of the first crystals obtained were characterized 
in-house and a data set was collected using a Rigaku MicroMax-007 
copper-anode X-ray generator coupled to an R-AXIS IV ++ dual 
image-plate detector. Subsequently, after the second crystallization 
condition had been identified a data set was collected using 
synchrotron radiation on beamline 103 at Diamond Light Source with 
a Pilatus 6M-F detector. 

2.2. Structure solution and refinement 

Diffraction data sets were indexed and integrated with iMosflm 
(Battye et al, 2011) and scaled using SCALA (Evans, 2006). The unit- 
cell parameters and symmetry suggested that the asymmetric unit 
consisted of a dimer with a V M of 2.99 A 3 Da -1 and about 60% bulk 
solvent. Structures were determined by molecular replacement using 
Phaser (McCoy et al, 2007). The first structure, determined to 1.95 A 
resolution, used the coordinates of a Siahl monomer (PDB entry 
2a25, chain A; Santelli et al, 2005) as a search model. The output 



model from molecular replacement identified the dimer previously 
observed and was subjected to a single cycle of rigid-body and 

Table 1 

Crystallographic statistics. 

Values in parentheses are for the highest resolution shell. 



PDB code 


4c9z 


4cal 


Space group 


7222 


7222 


Unit-cell parameters (A) 


a = 75.14, b = 104.59, 


a = 76.06, b = 104.12, 




c = 133.16 


c = 133.59 


Resolution (A) 


44.99-1.95 (2.06-1.95) 


52.06-1.58 (1.67-1.58) 


No. of reflections recorded 


141688 (11394) 


359712 (50840) 


Unique reflections 


38340 (5357) 


72738 (10523) 


Completeness (%) 


99.3 (96.0) 


100.0 (100.0) 


Multiplicity 


3.7 (2.1) 


4.9 (4.8) 


(I/cr(l)) 


19.3 (4.5) 


16.1 (3.5) 


Mosaicity (°) 


0.4 


0.5 


Wilson B factor (A 2 ) 


29.2 


19.1 


Rotation per frame (°) 


0.5 


0.2 


Total rotation range (°) 


100 


148 


Exposure per frame (s) 


600 


0.2 


Crystal-to-detector distance (mm) 


160.0 


268.8 


Wavelength (A) 


1.5418 


0.9763 


Residues 






Chain A 


91-282 


91-198, 202-282 


Chain B 


95-282 


91-197, 200-282 


No. of water molecules 


218 


331 


No. of Tris molecules 


2 


0 


No. of glycerol molecules 


3 


12 


No. of sulfate ions 


3 


5 


No. of Cl~ ions 


4 


3 


No. of Zn 2+ ions 


4 


4 


^merget (%) 


3.7 (17.8) 


5.2 (47.8) 


^workt (%) 


19.8 


14.1 


tffree§ (%) 


23.9 


18.9 


Average B factor for all atoms (A) 


18.7 


22.9 


Cruickshank DPI! (A) 


0.149 


0.073 


Ramachandran plottt 






Most favoured (%) 


96.9 


97.4 


Additional allowed (%) 


2.8 


2.6 


Outliers (%) 


0.3 


0.0 


R.m.s.d. on ideal values$t 






Bond length (A) 


0.01 


0.02 


Bond angle (°) 


1.32 


2.34 



t R m 



■ Thki E, Miihkl) - (I(hkl))\/j: hkl EJAhkl), where h{hkl) is the intensity of 



the zth measurement of reflection hkl and (I(hkl)) is the mean value of I^hkl) for all i 
measurements. X # work = J^hki \ \ F obs I - l^cak 1 1 /Em/ \ F obs I . where ^obs is the observed 
structure-factor amplitude and F calc is the structure-factor amplitude calculated from the 
model. § i? free is the same as 7^ work except calculated with a subset (5%) of data that 
were excluded from refinement calculations, Diffraction-component precision 
indicator (Cruickshank, 1999). ft Chen et al. (2010). tt Engh & Huber (1991). 
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Figure 1 

The primary and secondary structure of Siahl. The figure shows the full-length sequence of human Siahl (UniProt entry Q8IUQ4). The secondary structure presented 
corresponds to residues 91-282 of the 1.95 A resolution structure. The secondary-structure elements which are numbered make up the SBD. Residues that were modelled in 
the 1.95 A resolution structure but that were absent in the search model used in molecular replacement are shown on a yellow background. 
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restrained refinement using REFMACS (Murshudov et aL, 2011). All 
atoms were assigned a starting B factor derived from the Wilson plot 
statistics generated in SCALA. Electron-density and difference 
density maps were inspected and model extension and fitting were 
carried out in Coot (Emsley et aL, 2010). This was followed by further 
cycles of refinement and model fitting, which resulted in the inclusion 
of 30-35 residues at the N-terminal end and three internal regions 
which were absent from the search model (Fig. 1). Zinc ions, water 
molecules, other ligands and side-chain rotamers were included once 
the protein model had been completed. The presence of the Zn 2+ ions 
was suggested by the chemical environment, the electron density and 
from previous studies of zinc-finger proteins. We confirmed the 
assignment with a single-wavelength anomalous dispersion analysis 
using diffraction data recorded at the X-ray absorption edge of zinc 
(data not shown). The source of the zinc ions was presumed to be the 
broth used to culture the E. coli cells. Tight local noncrystallographic 
symmetry restraints were used in the early rounds of refinement, 
which were subsequently released. TLS (translation/libration/screw) 
parameters obtained from the TLSMD server (Painter & Merritt, 
2006) were applied in the later stages of refinement. The coordinates 
of this structure were subsequently used to solve the second structure 
of Siahl to 1.58 A resolution. The B factors were reset to the Wilson 
B value and refinement was carried out as described above but, with 
higher resolution data available, anisotropic temperature factors 
were refined. It was noted that significant changes to the model were 
required in the region of one N-terminal zinc-finger domain. Model 
geometry was monitored during refinement with MolProbity (Chen et 
aL, 2010). Structure superpositions were calculated using Secondary 
Structure Matching (Krissinel & Henrick, 2007) in Coot or LSQKAB 
(Kabsch, 1976), and the Protein Interfaces, Surfaces and Assemblies 
(PISA) server (Krissinel & Henrick, 2007) was used to characterize 
the dimer interface. Secondary structure was assigned using Coot and 
by inspection. Figures were prepared using PyMOL (DeLano, 2002) 
and ALINE (Bond & Schiittelkopf, 2009). Data-collection and 
refinement statistics are shown in Table 1. 



3. Results and discussion 

3.1. Quality of the structures 

Two isomorphous crystal structures of Siahl were determined at 
1.95 and 1.58 A resolution. There are two molecules per asymmetric 
unit, labelled chains A and B, and they form a dimer consistent both 
with the observation that a single species of approximate mass 
39 kDa was noted during size-exclusion chromatography purification 
and with published structures of Siahl (House et aL, 2003; Polekhina 
et aL, 2002; Santelli et aL, 2005; Stebbins et aL, 2013). Continuous well 
defined electron density is observed for the polypeptide backbone in 
the 1.95 A resolution structure, corresponding to residues 91-282 of 
the native Siahl sequence. Here, chain A also contains an additional 
methionine at the N-terminus, which is a cloning artifact. The 
electron-density map was very well defined in the 1.58 A resolution 
structure, apart from one loop region (Tyrl99-Gly201 in chain A and 
Lysl98-Tyrl99 in chain B), and these residues were excluded from 
the model. A Ramachandran plot shows that 97.4% of the residues in 
the high-resolution structure are located in the most favoured region, 
with no outliers. One outlier is present in the lower resolution 
structure and 96.9% of the amino acids are situated in the most 
favoured region. The diffraction-component precision indicator 
(DPI; Cruickshank, 1999) values (Table 1) are 0.073 and 0.149 A for 
the 1.58 and 1.95 A resolution structures, respectively. The DPI 
values for PDB entries 2a25, 4i7c, 4i7d and lk2f, as determined by the 



Electron Density Server (Kleywegt et aL, 2004), are 0.233, 0.470, 0.234 
and 0.186 A, respectively. 

3.2. Overall structure and comparisons with previous Siahl models 

Each Siahl subunit consists of two zinc-finger subdomains and a 
C- terminal substrate-binding domain (SBD; Fig. 2). The N-terminal 
zinc-finger motif belongs to the Cys 2 HisCys class of zinc fingers, while 
the second constitutes a member of the Cys 2 His 2 type. The SBD 
forms a dimerization interface with its counterpart from the partner 
subunit to create a Z-shaped structure (Fig. 3). The SBD consists of 



★ 




Figure 2 

Secondary-structure elements of the Siahl monomer. Ribbon diagram showing the 
SBD and two zinc-finger subdomains of the 1.58 A resolution structure. The 
a-helices and /3-strands are coloured purple and orange, respectively. The N- and 
C-terminal ends are labelled, as are the secondary-structure elements. Asterisks 
mark the positions of disordered loops. 




Figure 3 

Overall structure of the Siahl dimer. The figure shows the 1.58 A resolution 
structure. The zinc-finger subdomains are coloured red and blue, while the 
substrate-binding domains (SBDs) of monomers A and B are coloured green and 
orange, respectively. The N- and C-termini are labelled. The Zn 2+ ions are shown as 
grey spheres and the disordered loops are marked with asterisks. 
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two antiparallel /3-sheets of four strands each placed on top of one 
another. We use the established numbering scheme from previous 
work (Polekhina et ah, 2002). The first /3-sheet consists of strands 02, 
03, 04 and 07, while the second consists of strands 01, 08, 05 and 06. 
An additional strand 00 is parallel to 02, connecting the SBD to 
an adjacent zinc finger. Furthermore, the SBD contains three short 
a-helices (cd-a3) that link 04 to 05, 06 to 01, and 01 to 08. 

The total surface area of a monomer is about 10 800 A 2 . The buried 
area between the two subunits is 1100 A 2 ; therefore, approximately 
10% of the surface area is involved in dimerization. Siahl dimer- 
ization involves the C-terminal regions of the SBD and about a dozen 
hydrogen bonds and salt bridges. The majority of these inter- 
molecular contacts occur between residues on 06 of each subunit. For 
example, Arg232 and Arg233 of one monomer establish salt-bridge 
interactions with Asp255 and Glu237 of the partner subunit (Fig. 4). 
The equivalent arginine residues in the partner subunit make the 
same contacts with the residues in the first subunit. A further six 
hydrogen bonds are formed between the main chains of these two 
antiparallel 06 strands, involving Arg233, Thr235 and Glu237. In 
addition, the subunit-subunit interactions are mediated by a number 




Figure 4 

The dimer interface, depicting some of the residues forming the intermolecular 
hydrogen bonding. Monomers are coloured green and orange. Nitrogen and 
oxygen positions are coloured blue and red, respectively. 




Figure 5 

Overlay of the two Siahl structures. The 1.95 and 1.58 A resolution structures are 
shown in cyan and orange, respectively, with their associated Zn 2+ ions as grey 
spheres. The Zn 2+ ion in the N-terminal zinc-finger motif is displaced by 14.5 A 
between the two structures. 



of water molecules (data not shown). The extensive network of 
intermolecular contacts and the large buried interface suggest a tight 
association of the two monomers. 

The degree of NCS is more highly conserved in the high-resolution 
structure, in which the superposition of chain A on chain B {111 C a 
positions) results in an r.m.s.d. of 1.36 A. In the lower resolution 
structure the r.m.s.d. is 5.36 A (181 C a positions). We became aware 
of significant differences at an early stage of refinement and this 
indicated that the NCS restraints should be released. Inspection 
confirmed that the NCS breakdown is owing to differences in the 
orientation of the N-terminal zinc-finger subdomain with respect to 
the rest of the protein. Superposition of the two new structures on 
each other gives r.m.s.d. values of 0.80 and 4.89 A for chains A (175 
C a atoms) and B (164 C a atoms), respectively. The high value again 
occurs owing to a difference in the position of the N-terminal zinc- 
finger subdomain in the two structures (Fig. 5). The result is that the 
Zn 2+ ions are displaced by about 14.5 A when the structures are 
superposed. The overall molecular packing in each lattice remains 
well conserved and it is simply that there is space that allows the 
N-terminal zinc finger of the B chain to adopt a position where it can 
either interact with a molecule at 1 — x + \, y + \, — z + \ as in 
the highest resolution structure or interact with a molecule at 
1 — x, 1 — y, z as in the case of the lower resolution structure (Fig. 6). 
The space vacated is then occupied by bulk solvent. 

Superpositions with previously determined structures in the PDB 
identify the same feature. Superposition of Siahl chains excluding the 
N-terminal zinc fingers give lower r.m.s.d. values, typically in the 
range 0.5-1.0 A for about 135 C a atoms; for the few structures where 
the zinc-finger subdomain is included the r.m.s.d. values increase to 
between 2.6 and 4.8 A. The notable conformational flexibility of the 
N-terminal zinc-finger subdomain may contribute to its role in the 
biological function by adapting its position to assist in interactions 
with distinct substrate proteins. Further work would be required to 
address such a hypothesis. 

3.3. Siahl interactions with other proteins and mode of inhibition 

Siahl interacts with a variety of protein targets as well as the 
binding partners that control the contributions to ubiquitination. For 




Figure 6 

The crystal-packing environment of the N-terminal zinc finger of the B chain of 
Siahl. The overlaid 1.95 and 1.58 A resolution structures are coloured cyan and 
orange, respectively. Three symmetry-related molecules of the 1.95 A resolution 
structure are coloured red (1 — x, 1 — y, z), blue (1 — x + \, y + \, — z + |) and black 
(x + |, — y + |, — z + |). The Zn 2+ ions are shown as grey spheres. 
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example, Siahl alone is able to polyubiquitinate /3-catenin, a key 
component of the Wnt [the name is a combination of the homologous 
genes Wg (wingless) and Int] signalling pathway controlling cell fate 
and proliferation (Dimitrova et al, 2010). However, this process is 
much more efficient when Siahl forms an E3 ligase complex with SIP 
(Siah-interacting protein), the adaptor protein Skpl and the F-box 
protein Ebi (Matsuzawa & Reed, 2001). Many Siahl -interacting 
proteins carry a conserved binding motif VxP, where x is any residue 
(House et al., 2003). The motif is present in SIP and its crucial role in 
facilitating interaction of SIP with Siahl has been shown in a number 
of biophysical studies (Santelli et al, 2005; Bhattacharya et al., 2005). 
As an aside, we note that full-length human SIP was expressed and 
purified for structural studies in complex with Siahl. The interaction 
between the two proteins, with one-to-one stoichiometry (one Siahl 
dimer and two SIP molecules), was confirmed by size-exclusion gel 
chromatography and the complex was purified, but crystallization 
experiments have so far failed to yield any positive results. 

The currently available inhibitors of Siahl are short peptides that 
contain the VxP motif in their sequences (House et al., 2003; Stebbins 
et al., 2013). These peptides have been shown to reduce tumour 
growth and metastasis in a number of model systems (Moller et al., 
2009; Qi et al, 2010). Furthermore, they provide good starting points 
for the design of novel Siahl inhibitors, and a number of covalent 
peptide-based inhibitors with improved affinities have recently been 
reported (Stebbins et al., 2013). Access to crystallization conditions 
that lead to high-resolution diffraction data as reported here might 
in future assist in the structural characterization of novel inhibitors in 
support of early-stage drug discovery. 

This work was funded by a Biotechnology and Biological Sciences 
Research Council and AstraZeneca studentship and the Wellcome 
Trust (grants 082596, 094090 and 100476). Diamond Light Source 
provided synchrotron beam time and we thank the staff for support. 
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